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Abstract 

We define and study a generalization of Sobol sensitivity indices for the case of a vector output. To cite this 
article: F. Gamboa, A. Janon, T. Kletn, A. Lagnoux, C. R. Acad. Sci. Paris, Ser. xx xxx (2013). 

Resume 

Indices de sensibilite pour sorties multivariees. Nous definissons et etudions une generalisation des indices 
de Sobol pour des sorties vectorielles. Pour citer cet arttcle : F. Gamboa, A. Janon, T. Klein, A. Lagnoux, G. R. 
Acad. Sci. Paris, Ser. xx xxx (2013). 



1. Introduction 

Many mathematical models encountered in applied sciences involve a large number of poorly-known 
parameters as inputs. It is important for the practitioner to assess the impact of this uncertainty on 
the model output. An aspect of this assessment is sensitivity analysis, which aims to identify the most 
sensitive parameters. In other words, parameters that have the largest influence on the output. In global 
stochastic sensitivity analysis, the input variables are assumed to be independent random variables. Their 
probability distributions account for the practitioner's belief about the input uncertainty. This turns the 
model output into a random variable. 

When the output is scalar, using the so-called Hoeffding decomposition [4], its total variance can be split 
down into different partial variances. Each of these partial variances measures the uncertainty on the 
output induced by the corresponding input variable. By considering the ratio of each partial variance to 
the total variance, we obtain a measure of importance for each input variable called the Sobol index or 
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sensitivity index of the variable [3]; the most sensitive parameters can then be identified and ranked as 

the parameters with the largest Sobol indices. 

Generalization of the Sobol index for multivariate (vector) outputs has been considered in [2] in an 
empirical way. In this note, we consider and study a new generalization of Sobol indices for vector 
outputs. These indices stem from an Hoeffding decomposition and satisfy natural invariance properties. 
In this note, we define the new sensitivity indices, examine some of their properties and show why they 
are natural. We also study a Monte-Carlo estimator of these indices, as in practice the exact values are 
not explicitly computable. 



2. Definitions and Properties 

2.1. Preliminaries 

Wc denote by Xi, . . . , Xp the input random variables defined on some probability space P), and by Y 
the output: Y = f{Xi, . . . , Xp), where f : Rp ^ R'^ {p, k are integers). We suppose that Xi, . . . , Xp are 
independent, that Y e i^(0,R*^), and that the covariance matrix of Y is positive definite. 
For any non-empty r-subset u of {1, . . . we set X^ = {Xf, i G u) and X^^ = (Xj, i e {1, . . . ,p} \ u). 

2.2. Definition of the generalized Sobol indices 
We recall the Hoefl[ding decomposition of / [4]: 

/(Xi, ...,Xp) = C + /u(X„) + /^u(^^u) + /u,~u(^u,^^u), (1) 

where c e M'=, /„ : M'' -S- /^u : R^"'' ^ K*' and /u,^u : M^' ^ M*' are given by: 

c = E{Y), /„ = E(y|Xu) - c, /^u = ]E(F|X^u) - c, /„,^u = Y-U-U^-c. 
Taking the covariance matrices of both sides of (1) gives (thanks to i^-orthogonality): 

S = a + C^u + Cu,^u, (2) 
where S, Cu, C^u and Cu,~u are, respectively, the covariance matrices of Y, /u(Xu), /^u(-^'-u) and 

/u,~u(Xu, Xr^n). 

For scalar outputs (ie., when k = 1), the covariance matrices are scalar (variances), and (2) is interpreted 
as the decomposition of the total variance of y as a sum of the variance caused by the variation of the 
input factors Xj for i G u, the variance caused by the input factors not in u, and the variance caused by 
the interactions of the factors in u and those not in u. The (univariate) closed Sobol index 5u>Scai^ ^-^ _ 
is then interpreted as the sensibility of Y to the inputs in u. Due to noncommutativity of the matrix 
product, a direct generalization of this index is not straightforward. 

We now go back to the general case. For any k x k matrix M, (2) can be projected on a scalar by 
multiplying by M and taking the trace: 

Tr(MS) = Tr(MCu) + Tr(MC^„) + Tr(MCu,^u). 

This equation is the natural scalarization of the matricial identity (2) (as, for a symmetric matrix V, we 
have j MijVij = Ti{MV)). This suggests to define, when Tr(MS) ^ 0: 

5u(M;/) = ^^ 
^ '■'^ Tr(ME) 
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as the M-sensitivity measure (sensitivity index, or generalized Sobol index) of Y to the inputs in u. We 
can also analogously define: 5~"(Af;/) = S'"''""(M; /) , which measures the 

sensitivity to, respectively, the inputs not in u, and to the interaction between inputs of u and inputs of 
{1, . . . ,p} \ u. The following lemma is obvious: 

Lemma 2.1 (i) The generalized sensitivity measures sum up to 1: 

S^{M- /) + 5~"(M; /) + 5"'-"(M; /) = 1. (3) 
(a) Left- composing f by a linear operator O ofM.'^ changes the sensitivity measure according to: 

(tii) For k = l, and for any M ^ 0, we have ^"(M; /) = 



2.3. The case M = Idk 



We now consider the special case M = Idk (the identity matrix of dimension k). We set Su{f) = 

5"(Idfc;/). The index S'u(/) has the following properties: 

Proposition 2.2 Suppose that Y e L'^{fl,R'') and that E is positive-definite: 

fi) < SM) < 1; 

(a) Su{f ) is invariant by left- composition of f by any isometry ofR'^, i.e. 

yOkxk matrix s.t. 0*0 = Idk, S^{Of) = 5u(/); 
(Hi) Su{f) is invariant by left-composition of f by any nonzero homothety ofR''. 

Proof: Point (i): positivity is clear, as Cu and E are positive; <S'u(/) < 1 follows from positivity and (3). 
For (ii), we use (4). Point (iii) is immediate. □ 

The properties in the Proposition above are natural requirements for a sensitivity measure (the isom- 
etry invariance property ensures that the resulting indices are "intrinsic" and docis not depend on the 
parametrization of the output). Note that <§„(/) is the sum of the partial variances divided by the sum of 
the total variances of each output coordinate, and the covariances between coordinates are not involved. 
In the next section, we will show that these requirements can be fulfilled by S^{M; •) iff M = Aldfe for 
A € M*. Hence the only "canonical" sensitivity measure is ^u- 



3. M = Idfe is the only good choice 

The following Proposition can be seen as a kind of converse of Proposition 2.2. 

Proposition 3.1 Let M be a square k x k matrix such that Tt{MV) ^ for any symmetric positive- 
definite matrix V. Now if for all f : MP ^ Ef', and all subsets u C {1, . . . ,p}, we have that S^{M; f) is 
invariant by left- composition of f by any isometry ofR'^, then S^{M;-) = <S'u(-)- 

Proof: Let M as in the Proposition. We can write M = Msym + MAntisym where Mg^^ = Msym and 
^Antisym = -^Antisym- Since, for any symmetric matrix V, we have TT:{MAnUsymy) = 0, we have 
S^{M; f) = S^{Msym'i f) and we can assume, without loss of generality, that M is symmetric. 
We diagonalize M in an orthonormcd basis: M = PDP*, where P*P = Idk and D diagonal. We have: 

S^(M- f) - ^'(PDP'^^) TrjDP^C^P) _ 
* Tr(PI)P*E) - Tr(DP*EP) 

This shows that M can in fact be assumed diagonal. 
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Now we want to show that M = Aldfc for some A e K*. Suppose, by contradiction, that M has two 
different diagonal coefficients Ai 7^ A2. It is clearly sufficient to consider the case k = 2. Choose / = Id2 
(hence, p = 2), and u = {1}. We have E = Id2, and Cu = ( )' hence on the one hand S^{M; /) = x^^- 
On the other hand, let O be the isometry which exchanges the two vectors of the canonical basis of M^. 
We have S'"(Af; Of) = and invariancc by isometry is contradicted if Ai 7^ A2. We also have A 7^ 

since Tr(M) ^ 0. Finally,' it is easy to check that, for any A e K*, ^"(Aldft; •) = ^"(Idfe; •) = S^. □ 



4. Estimation of 

In general, the covariancc matrices Cu and S arc not analytically available. In the scalar case {k = 1), it 
is customary to estimate S'"'^'^'^'(/) by using a Monte-Carlo pick-freeze method [3, 1], which uses a finite 
sample of evaluations of /. In this Section, we propose a pick- freeze estimator for the vector case which 
generalizes the estimator studied in [1]. We set: = f{Xu,X'^^) where is an independent copy 
of Xr^u- Let N be an integer. We take N independent copies Yi, . . . , (resp. F", . . . , Y^) of Y (resp. 
y"). For I = 1, . . . ,k, and i = 1, . . . , A^, we also denote by Yi^i (resp. i^") the l^^^ component of Yi (resp. 
yj"). We define the following estimator of Su{f)- 



SuMf) = 



2^1=1 I Z^i=l 2 N \2^i=l 2 J j 



Thanks to the simple form of this estimator, the following Proposition can be proved in a way similar to 
the one used to prove Proposition 2.2 and Proposition 2.5 of [1] (ie., by an application of the so-called 
Delta method). 

Proposition 4.1 Suppose Y e L*(f2, M*^), and that S is positive-definite. Then: 

(i) {Su,N{f)) pf is asymptotically normal: there exists a = cr(/) so that VN{Su,Nif) — Su{f)) converges 
(for N — > +00 ) in distribution to a centered Gaussian distribution with variance . 

(ii) (5u,Ar(/))jY is asymptotically efficient for estimating Su{f) among regular estimator sequences that 
are function of exchangeable pairs {Y, F"). 
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